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METHOD FOR GENERATING LIBRARIES OF ANTIBODY GENES 
COMPRISING AMPLIFICATION OF DIVERSE ANTIBODY DNAs 
AND METHODS FOR USING THESE LIBRARIES FOR THE 
PRODUCTION OF DIVERSE ANTIGEN COMBINING MOLECULES 
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Background of the Invention 

Monoclonal and polyclonal antibodies are useful for a variety of purposes. The precise 
antigen specificity of antibodies makes them powerful tools that can be used for the detection, 
quantitation, purification and neutralization of antigens. 

Polyclonal antibodies are produced in vivo by immunizing animals, such as rabbits and 
goats, with antigens, bleeding the animals and isolating polyclonal antibody molecules from the 
blood. Monoclonal antibodies are produced by hybridoma cells, which are made by fusing, in 
vitro, immortal plasmacytoma cells with antibody producing cells (Kohler, G. and C. Milstein, 
Nature, 256:495 (1975)) obtained from animals immunized in vivo with antigen. 

Current methods for producing polyclonal and monoclonal antibodies are limited by 
several factors. First, methods for producing either polyclonal or monoclonal antibodies require 
an in vivo immunization step. This can be time consuming and require large amounts of antigen. 
Second, the repertoire of antibodies expressed in vivo is restricted by physiological processes, 
such as those which mediate self-tolerance that disable auto-reactive B cells (Goodnow, C. C., et 
al, Nature, 334:676 
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(1988); Goodnow, J.W., Ba s ic_and_Cl inic a 1_I mmunology., £d • 
5, Los Altos, CA, Large Medical Publications (1984); 
Young, C . R . , Mo 1 e cu 1 a r_I mnuno 1 o^i , New York, Marcel 
Dekker (1984)). Third, although antibodies can exist in 

05 millions of different forms, each with its own unique 
binding site for antigen, antibody diversity is 
restricted by genetic mechanisms for generating antibody 
diversity (Honjo, T., Ann^_R ev^_ Immunol ^ , 1:499 (1983); 
Tonegawa, S., H a t u r e^ 3 0 2 : 5 7 5 (1983)). Fourth, not all 

10 the antibody molecules which can be generated will be 

generated in a given animal. As a result, raising high 
affinity antibodies to a given antigen can be very time 
consuming and can often fail. Fifth, the production of 
human antibodies of desired specificity is very 

15 problematical. 

A method of producing antibodies which avoids the 
limitations of present ly- available methods, such as the 
requirement for immunization of an animal and in_vivo 
steps, would be very useful, particularly if it made it 

20 possible to produce a wider range of antibody types than 
can be made using presently-available techniques and if 
it made it possible to produce human antibody types. 

The present invention relates to a method of produc- 
25 ing libraries of genes encoding antigen- combining 

molecules or antibodies; a method of producing- antigen- > 
combining molecules, also referred to as antibodies, 
which does not require an in_vivo procedure, as is 
required by presently- available methods; a method of 
30 obtaining ant igen- combining molecules (antibodies) of 
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selected or defined specificity which does not require an 
in_vivo procedure; vectors useful in the present method 
and antibodies produced or obtained by the method. 

The present invention relates to an in_vitro process 
05 for synthesizing DN A encoding families of antigen- 
combining molecules or proteins. In this process, DNA 
containing genes encoding antigen-combining molecules is 
obtained and combined with oligonucleotides which are 
homologous to regions of the genes which are conserved. 
10 Sequence-specific gene amplification is then carried out 
using the DNA containing genes encoding antigen- combining 
proteins as template and the homologous oligonucleotides 
as primers . 

This invention also relates to a method of creating 

15 diverse libraries of DNAs encoding families of antigen- 
combining proteins by cloning the product of the in_vitro 
process for synthesizing DNA , described in the preceeding 
paragraph, into an appropriate vector (e.g., a plasmid, 
viral or retroviral vector). 

20 The subject invention provides an alternative method 

for the production of antigen-combining molecules, which 
are useful affinity reagents for the detection and 
neutralization of antigens and the delivery of molecules 
to antigenic sites. The claimed method differs from 

25 production of polyclonal antibody molecules derived by 

immunization of live animals and from production of mono- 
clonal antibody molecules through the use of hybridoma 
cell lines in that it does not require an in_vivo 
immunization step, as do presently available methods. 

30 Rather, diverse libraries of genes which encode- antigen- 
ycombining sites comprising a significant proportion of an 
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animal's repertoire of antibody combining sites are made 
as described in detail herein. These genes are expressed 
in living cells, from which molecules of desired 
antigenic selectivity can be isolated and purified for 

various uses . 

Antigen-combining molecules are produced by the 

present method in the ^following manner, which is 
described in greater deail below. Initially, 
of antibody genes which includes a set of variable 
regions encoding a large, diverse and random group of 
specificities derived from animal or human immunoglob- 
ulins is produced by amplifying or cloning diverse 
genomic fragments or cDNAs of an-t.ib.od_y. mRN As found m 
antibody-producing tissue. 

,n an op"on>l step, the "vers!., ° £ ^ " SUl "" 8 
Ubraries con bo increased by -.eans o£ r,»a.. » u "" 
gnosis. The 8 e„o libraries are introduce, inc. cultured 
host cells, which »ay be euaaryotic or proaaryoc.e, x» 
vhich chey ere expressed. Genes encoding encibodres of 
desired antigenic specificity ere identified, using a 
„ethod described herein or known techniques, isolated 
expressed in quantities in appropriate host cells. fro» 
»hich the encoded antibody can be purified. 

Specifically, e library of genes encoding. 
i„„uno 6 lobulin heavy chain regions and a library of genes 
encoding i.cun.globul in light cbain regions are con- 
strutted. This is cerri.d out by obtaining *<nil>°iy 
encoding DNA , which is either geno.ic fragment s or cDNAs 
of antibody inRNAs , e.plfying or cloning the fragments 
cDNAs ; and introducing then, into . standard framework 
antibody gene vector, which is used to introduce the 
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ant ibody - encoding DNA into cells in which the DNA is 
expressed. The vector includes a framework gene encoding 
a protein, such as a gene encoding an antibody heavy 
chain or an antibody light chain which can be of any 

05 origin (human, non-human) and can be derived from any of 
a number of existing DNAs encoding heavy chain immuno- 
globulins or light chain immunoglobulins. Such vectors 
are also a subject of the present invention and are 
described in greater detail in a subsequent section. 

10 Genes from one or both of the libraries are introduced 
into appropriate host cells, in which the genes are 
expressed, resulting in production of a wide variety of 
ant i gen- combining molecules. 

Genes encoding an t i gen - c omb in ing molecules of 

15 desired specificity are identified by identifying cells 
producing antigen- combining molecules which react with a 
selected antigen and then obtaining the genes of 
interest. The genes of interest can subsequently be 
introduced into an appropriate host cell (or can be 

20 further modified and then introduced into an appropriate 
host cell) for further production of antigen- combining 
molecules, which can be purified and used for the same 
purposes for which conventionally-produced antibodies are 
used . 

25 Through use of the method described, it is possible 

to produce antigen-combining molecules which are of wider 
diversity than are antibodies available as a result of 
known methods; novel ant i gen - c omb ining molecules with . a 
diverse range of specificities and affinities and 

30 ant igen- combining molecules which are predominantly human 
in origin. Such an t i gen - c omb in ing molecules are a 
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subject of the present invention and can be used 
clinically for diagnostic, therapeutic and prophylactic 
purposes, as well as in research contexts, and for other 
purpos e s . 

05 Brief_Descri£tion_of_the_Drawin£S 

Figure 1 is a schematic representation of the method 
of the present invention by which anti gen - comb ining 
molecules, or antibodies, are produced. 

Figure 2 is a schematic representation of amplifica- 
10 tion or cloning of IgM heavy chain variable region DNA 
from mRNA, using the polymerase chain reaction, 
Panel_A shows the relevant regions of the poly adenylated 
mRNA encoding the secreted form of the IgM heavy chain. 
S denotes the sequences encoding the signal peptide which 
15 causes the nascent peptide to cross the plasma membrane. 
V, D and J together comprise the variable region. C H 1, 
C H 2, and C R 3 are the three constant domains of C^- Hinge 
encodes the hinge region. C, B and Z are oligonucleotide 
PCR primers (discussed below). 

20 Panel_B shows the reverse transcript DNA product of the 
mRNA primed by oligonucleotide Z, with the addition of 
poly-dC by terminal transferase at the 3' end. 
Panel_C is a schematic representation of the annealing of 
primer A to the reverse transcript DNA. 

25 Panel_D shows the final double stranded DNA PCR product 
made utilizing primers A and B. 

Panel_E shows the product of PCR annealed to primer C. 
Panel_F is a blowup of Panel E, showing in greater detail 
.the structure of primer C. Primer C consists of two 
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parts: a 3' part complementary to IgM heavy chain mRNA 
as shown, and a 5' part which contains restriction site 
RE2 and spacer. 

Panel_G shows the final double stranded DNA PCR product 
made utilizing primers A and C and the product of the 
previous PCR (depicted in D) as template. The S, V, D, J 
regions are again depicted. 

Figure 3 is a schematic representation of the heavy 
chain framework vector pFHC. The circular plasmid 
(above) is depicted linearized (below) and its relevant 
components are shown: animal cell antibiotic resistance 
marker; bacterial replication origin; bacterial cell 
antibiotic. .resistance -marker.; Cji enhancer; LTR containing 
the viral promoter from the Moloney MLV retrovirus DNA; 
PCR primer (D) ; cDNA cloning site containing restriction 
endonuclease sites, RE1 and RE2 , separated by spacer DNA; 
C M exons; and poly A addition and termination sequences 
derived from the Cfi gene or having the same sequence as 
the C/i gene . 

Figure 4 depicts a nucleotide sequence of the C H 1 
exon of the Cn gene, and its encoded amino acid sequence 
(Panel A). The nucleotide coordinate numbers are listed 
above the line of nucleotide sequences. Panel B depicts 
the N-doped sequence, as defined in the text. 

The present invention provides a method of producing 
antigen-combining molecules (or antibodies) which does 
not require an in_vivo immunization procedure and which 
makes it possible to produce antigen- combining -molecules 
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vith far greater diversity than is shown by antibodies 
produced by currently - available techniques. 

The present invention relates to a method of 
producing libraries of genes encoding antigen- combining 
molecules (antibody proteins) with diverse 
antigen-combining specificities; a method of producing 
such antigen-combining molecules, antigen-combining 
molecules produced by the method and vectors useful in 
the method. The following is a description of generation 
of such libraries, of the present method of producing 
antigen-combining molecules of selected specificity and 
of vectors useful in producing antigen- combining 

molecules of the present- invention. 

As described below, the process makes use of 
techniques which are known to those of skill in the art 
and can be applied as described herein to produce and 
identify antigen-combining molecules of desired antigenic 
specificity: the polymerase chain reaction (PCR), to 
amplify and clone diverse cDHAs encoding antibody mRNAs 
found in antibody-producing tissue; mutagenesis protocols 
to further increase the diversity of these cDNAs ; gene 
transfer protocols to introduce antibody genes into 
cultured (prokaryotic and eukaryotic) cells for the 
purpose of expressing them; and screening protocols to 
detect genes encoding antibodies of the desired antigenic 
specificity. A general outline of the present method is 
represented in Figure 1. 
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Anti£en^Corabining_Molecules 

A key step in the production of antigen- combining 
molecules by the present method is the construction of a 
05 "library" of antibody genes which include "variable- 
regions encoding a large, diverse, but random set of 
specificities. The library can be of human or non-human 
origin and is constructed as follows: 

Initially, genomic DNA encoding antibodies or cDNAs 
10 of antibody mRNA (referred to as ant ibody - encoding DNA) 

is obtained. This DNA can be obtained from any source of 
antibody-producing cells, such as spleen cells, 
peripheral blood cells, lymph nodes., inflammatory tissue 
cells and bone marrow cells. It can also be obtained 
15 from a genomic library or cDNA library of B cells. The 
antibody-producing cells can be of human or non-human 
origin; genomic DNA or mRNA can be obtained directly from 
the tissue (i.e., without previous treatment to remove 
cells which do not produce antibody) or can be obtained 
20 after the tissue has been treated to increase 

concentration of antibody-producing cells or to select a 
particular type(s) of ant ibody - produc ing cells (i.e., 
treated to enrich the content of antibody-producing 
cells). Antibody-producing cells can be stimulated by an 
25 agent which stimulates antibody mRNA production (e.g., 
lipopolysaccharide) before DNA is obtained. 

Antibody-encoding DNA is amplified and cloned using 
a known technique, such as the PCR using appropriately - 
selected primers, in order to produce sufficient quanti- 
30 ties of the DNA and to modify the DNA in such a-manner 
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(e.g., by addition of appropriate restriction sites) that 
it can be introduced as an insert into an E^_coli cloning 
vector. This cloning vector can serve as the expression 
vector or the inserts can later be introduced into an 
05 expression vector, such as the framework antibody gene 

vector described below. Amplified and cloned DNA can be 
further diversified, using mutagenesis, such as PCR, in 
order to produce a greater diversity or wider repertoire 
of antigen-binding molecules, as well as novel antigen- 

10 binding molecules. 

Cloned an t ibody - enc oding DNA is introduced into an 
expression vector, such as the framework antibody gene 
vector of the present invention, which can be a plasmid, 
viral or retroviral vector. Cloned ant ibody - encoding DNA 

15 is inserted into the vector in such a manner that the 
cloned DNA will be expressed as protein in appropriate 
host cells. It is essential that the expression vector 
used make it possible for the DNA insert to be expressed 
as a protein in the host cell. One expression vector 

20 useful in the present method is referred to as the 

framework antibody gene vector. Vectors useful in the 
present method contain antibody constant region or 
portions thereof in such a manner that when amplified DNA 
is inserted, the vector expresses a chimeric gene product 

25 comprising a variable region and a constant region in 

proper register. The two regions present in the chimeric 
gene product can be from the same type of immunoglobulin 
molecule or from two different types of immunoglobulin 
molecules . 

30 These libraries of antibody- encoding genes, are then 

expressed in cultured cells, which can be eukaryotic or 
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prokaryotic. The libraries can be introduced into host 
cells separately or together. Introduction of the 
antibody - encoding DNA in_vitro into host cells (by 
infection, transformation or transf ec tion) is carried out 
05 using known techniques, such as elec troporat ion , 

protoplast fusion or calcium phosphate co-precipitation. 
If only one library is introduced into a host cell, the 
host cell will generally be one which makes the other 
antibody chain, thus making it possible to produce 
10 complete/functional antigen-binding molecules. For 

example, if a heavy chain library produced by the present 
method is introduced into host cells, the host cells will 
. .genjerally be. cultured ..cells, such as myeloma cells or IL 
coli, which naturally produce the other (i.e., light) 
15 chain of the immunoglobulin or are engineered to do so. 
Alternatively, both libraries can be introduced into 
appropriate host cells, either simultaneously or 
sequentially . 

Host cells in which the ant ibody - encod ing DNA is 
20 expressed can be eukaryotic or prokaryotic. They can be 
immortalized cultured animal cells, such as a myeloma 
cell line which has been shown to efficiently express and 
secrete introduced immunoglobulin genes (Morrison, S.L., 
e t _a 1 . , Ann^J^Y^Acad^Scr. , 507 :187 (1987); Kohler, G. 
25 and C. Mil stein, Eur J Immunol^ , 6:511 (1976); Oi, 
V . T . , e t_al . , I mmunogl obu 1 i n_G e ne_Ex£r ess ion_in 
Il^sfo rme d_Lym£ho i d^Ce 1 1 s , 80:825 (1983); Dans , A.C. 
and M.J. Shulman, 1 mmuno l^_To d ay , 10:119 (1989)). One 
host cell which can be used to express the antibody- 
30 encoding DNA is the J558L cell line or the SP2/0 cell 
line. 
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Cells expressing antigen-combining molecules with a 
desired specificity for a given antigen can then be 
selected by a variety of means, such as testing for 
reactivity with a selected antigen using nitrocellulose 
layering. The antibodies identified thereby can be of 
human origin, nonhuman origin or a combination of both. 
That is, all or some of the components (e.g., heavy 
chain, light chain, variable regions, constant regions) 
can be encoded by DNA of human or nonhuman origin, which, 
when expressed produces the encoded chimeric protein 
which, in turn, may be human, nonhuman or a combination 
of both. In such antigen- combining molecules, all or 
some of the .regions (e.g. . he_avy and light chain variable 
and constant regions) are referred to as being of human 
origin or of nonhuman origin, based on the source of the 
DNA encoding the ant igen - comb ining molecule region in 
question. For example, in the case in which DNA encoding 
mouse heavy chain variable region is expressed in host 
cells, the resulting antigen- combining molecule has a 
heavy chain variable region of mouse origin. Antibodies 
produced may be used for such purposes as drug delivery, 
tumor imaging and other therapeutic, diagnostic and 
prophylactic uses. 

Once antibodies of a desired binding specificity are 
obtained, their genes may be isolated and further 
mutagenized to create additional antigen combining 
diversity or antibodies of higher affinity for- antigen. 
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£D^^Il£^^£li£n_£f--5B££^£^_^!l£iS£!}^^i!l^iB£_ll£l££Hl££ 

The following is a detailed description of a 
specific experimental protocol which embodies the 
05 concepts described above. Although the following is a 
description of one particular embodiment, the same 
procedures can be used to produce libraries in which the 
immunoglobulin and the heavy chain class are different or 
in which light chain genes are amplified and cloned. The 
10 present invention is not intended to be limited to this 
example. In the embodiment presented below, a diverse 
heavy chain gene library is constructed. Using the 
prin.ciple.s_d escr ibed, in. r e latjLon to the heavy chain gene 
library, a diverse light chain gene library is also 
15 constructed. These are co-expressed in an immortal tumor 
cell capable of producing antibodies, such as plasma- 
cytoma cells or myeloma cells. Cells expressing antibody 
reactive to antigen are identified by a nitrocellulose 
filter overlay and antibody is prepared from cells 
20 identified as expressing it. As described in a subse- 
quent section, there are alternative methods of library 
construction, other expression systems which can be used, 
and alternative selection systems for identifying anti- 
body-producing cells or viruses. 
25 Step 1 in this specific protocol is construction of 

libraries of genes in E__coli which encode immunoglobulin 
heavy chains. This is followed by the use of random 
mutagenesis to increase the diversity of the library, 
which is an optional procedure. Step 2 is introduction 
30 of the library, by trans fee t ion , into myeloma c^ells. 
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Scep 3 is identification of myeloma cells expressing 
antibody with the desired specificity, using the 
nitrocellulose filter overlay technique or techniques 
known to those of skill in the art. Step 4 is isolation 
of the gene(s) encoding the antibody with the desired 
specificity and their expression in appropriate host 
cells, to produce antigen-combining fragments useful for 
a variety of purposes. 

One key step in construction of the library of cDNAs 
encoding the variable region of mouse heavy chain genes 
. is cons.truc.tion of an E^_coli plasmid vector , designated 
pFHC. pFHC contains a "framework" gene, which can be 
any antibody heavy chain and serves as a site into which 
the amplified cloned gene product (genomic DNA or cDNA of 
antibody mRNAs) is introduced. pFHC is useful as a 
vector for this purpose because it contains RE1 and RE2 
cloning sites. Other vectors which include a framework 
gene and other cloning sites can be used for this purpose 
as well. The framework gene includes a transcriptional 
promoter (e.g.. a powerful promoter, such as a Moloney 
LTR (Mulligan, R.C., In_Ex E er imental_Mani E Hl a t ion_o f _Gene 
Expression, New York Adacemic Press, p. 155 (1983)) and a 
C„ chain transcriptional enhancer to increase the level 
of transcriptions from the promoter (Gillies, S.D., et 
al., Call, 33:717 (1983), a cloning site containing RE1 
and RE2; part of the C M heavy chain gene encoding 
secreted protein; and poly A addition and termination 
sequ nces (Figure 3). The framework antibody g*ne vector 
of th present invention (pFHC) also includes a 
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selectable marker (e.g., an antibiotic resistance gene 
such as the neomycin resistance gene, neo ) for animal 
cells; sequences for bacterial replication (ori); and a 
selectable marker (e.g., the ampicillin resistance gene, 

05 Amp R ) for bacterial cells. The framework gene can be of 
any origin (human, non-human), and can derive from any 
one of a number of existing DNAs encoding heavy chain 
immunoglobulins (Tucker, P.W., e t_a 1 . , Science, 206:1299 
(1979); Honjo, T., e t _a 1 . , Cell, 18:559 (1979); Bothwell, 

10 A.L.M., et_al., Cell, 24:625 (1981); Liu, A.Y, et_al. , 
Gene , 5 4:33 (1987); Kavakami , T., e t_a 1 . , Nuc^^Acids. 
Res^, 8:3933 (1980)). In this embodiment, the vector 
retains the introns be men _ the ' C H 1 , hinge, C H _2_ and C R 3 
exons. The "variable region" of the gene, which includes 

15 the V, D and J regions of the antibody heavy chain and 
which encodes the antigen binding site, is deleted and 
replaced with two consecutive restriction endonuclease 
cloning sites, RE1 and RE2 . The restriction endonuclease 
site RE1 occurs just 3' to the LTR promoter and the 

20 restriction endonuclease site RE2 occurs within the 

constant region just 3 r to the J region (see Figure 3). 

Another key step in the production of antigen- 
combining molecules in this embodiment of the present 
invention is construction in an E^_coli vector of a 

25 library of cDNAs encoding the variable region of mouse 
immunoglobulin genes. In this embodiment, the pFHC 
vector, which includes cloning sites designated RE1 and 
RE2, is used for cloning heavy chain variable regions, 
although any cloning vector with cloning sites having the 

30 same or similar characteristics (described below) can be 
used. Similarly, a light chain vector can be designed, 
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using the above described procedures and procedures known 
to a person of ordinary skill in the art. 

In this embodiment, non-immune mouse spleens are 
used as the starting material. mRNA is prepared directly 
from the spleen or from spleen processed in such a manner 
that it is enriched for resting B cells. Enrichment of 
tissue results in a more uniform representation of 
antibody diversity in the starting materials. 
Lymphocytes can be purified from spleen using ficoll 
gradients (Boyum, A., S c and__ J f _C 1 in i c al_Inv e s , 
21:77 (1968)). B cells are separated from other cells 
(e.g., T cells) by panning with anti-lgM coated dishes 
(Wysocki, L,J. a-nd.-V.L-. Sato, Proc^_Na t L_Acad^_S ci., 
75:2844 (1978)). Because activated cells express the 
IL-2 receptor but resting B cells do not. resting B cells 
can be separated yet further from activated cells by 
panning. Further purification by size fractionation on a 
Cell Sorter results in a fairly homogeneous population of 

resting B cells. 

Poly A+ mRNA from total mouse spleen is prepared 
according to published methods (Sambrook, J., e t_al . . 

?l£l.®£Hi^S_£i£IiiS£i A_Laboratory_Manual , 2d Ed., Cold 

Spring~Harbor Laboratory Press, Cold Spring Harbor, NY 
(1989)). Production of antibody mRNA can first be 
stimulated by lipopolysaccharide (LPS) (Andersson, J. A., 
£t_al., J^_Ex Ei _Med^, 115:1511 (1977)). First strand 
cDN/Tis prepared to this mRNA population using as primer 
an oligonucleotide, Z, which is complementary to C^ i* 
the C H 1 region 3' to J . This primer is designated Z in 
Figure 2. First strand cDNA is then elongated-by the 
germinal transferase reaction with dCTP to form a poly dC 
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tail ( Sambrook , J., e t_a 1 . , Molecul ar_C 1 oningj_ A 

Labor a to ry_Manua 1 , 2d Ed., Cold Spring Harbor Laboratory 
Press, Cold Spring Harbor, NY (1989)). 

This DNA product is then used as template in a 

05 polymerase chain reaction (PCR) to amplify cDNAs encoding 
antibody variable regions (Saiki, R.K., e t _a 1 . , Science, 
23 9:487 (1988); Ohara, 0.. et_al. , Pr oc^_Na t l^_Ac a d_^_S c i^ 
USA. 86:5673 (1989)). Initially. PCR is carried out with 
two primers: primer A and primer B, as represented in 

10 Figure 2. Primer A contains the RE1 site at its 5' end, 
followed by poly dG . Primer B is complementary to the 
constant (C H D region of the Cm gene, 3' to the J region 
and _5.* to. primer Z (see Figure 2) . Primer B is 
complementary to all C>i genes, which encode the heavy 

15 chain of molecules of the IgM class, the Ig class 

expressed by all B cell clones prior to class switching 

(Schimizu, A. and T. Honjo, Cell, 36:801-803 (1984)) and 

present in resting B cells. The resultant PCR product 

includes a significant proportion of cDNAs encompassing 

20 the various V u regions expressed as IgM in the mouse. 

n 

(The use of other primers complementary to the cDNA genes 
encoding the constant regions of other immunoglobulin 
heavy chains can be used in parallel reactions to obtain 
the variable regions expressed on these molecules, but 

25 for simplicity these are not described). 

Next, the product of the first PCR procedure is used 
again for PCR with primer A and primer C. Primer C, like 
primer B, is complementary to the Cn gene 3' to J and 
just 5' to primer B (see Figure 2). Primer C contains 

30 the RE2 site at its 5' end. The RE2 sequence is chosen 
in such a manner that when it is incorporated into the 
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framework vector, no alteration of coding sequence of the 
Cp chain occurs (See Figures 2 and 3). This method of 
amplifying C/i cDNAs , referred to as unidirectional nested 
PCR, incorporates the idea of nested primers for cloning 
a gene when the nucleotide sequence of only one region of 
the gene is known (Ohara, 0., e t_al . . Pr oc._Na t l^_Ac ad^ 
Scj__USA, 86:5673 (1989)). The PCR product is then 
cleaved with restriction enzymes RE1 and RE2 and cloned 
into the RE1 and RE2 sites of the pFHC vector (described 
below). The sequence of primers and of RE1 and RE2 sites 
are selected so that when the PCR product is cloned into 
these sites, the sites are recreated and the cloned 
antibody gene fragments are brought back into the proper 
frame with respect to the framework immunoglobulin gene 
present in pFHC . This results in creation of a C M 
minigene which lacks the intron normally present between 
J and the C H 1 region of C M (See Figure 3). These 
procedures result in production of the heavy chain 
library used to produce antigen-binding molecules of the 
present invention, as described further below. 

Optionally, diversity of the heavy chain variable 
region is increased by random mutagenesis, using 
techniques known to those of skill in the art. 

For example, the library produced as described above 
is amplified again, using PCR under conditions of 
limiting nucleotide concentration, Such conditions are 
known to increase the infidelity of the polymerization 
and result in production of mutant products. Primers 
useful for this reaction are Primers C and D, as 
represented in Figures 2 and 3. Primer D derives from 
^FHC just 5' to RE1. The PCR product, after cleavage 
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with RE1 and RE2 , is recloned into the framework vector 

pFHC. To the extent that mutation affects codons of the 

antigen binding region, this procedure increases the 

diversity of the binding domains. For example, if the 

6 

starter library has a complexity of 10 elements, and an 
average of one mutation is introduced per complementarity 
determining region, and it is assumed that the 
complementarity determining region is 40 amino acids in 
size and that any of six amino acid substitutions can 
occur at a mutated codon, the diversity of the library 
can be increased by a factor of about 40 x 6, or 240, for 
single amino acid changes and 240 x 240, or about 
6 x 10 4 , for double amino acid changes, yielding a f inal 
diversity of approximately 10 11 . This is considered to 
be in the range of the diversity of antibodies which 
animals produce (Tonegawa, S., Nature, 302:575 (1983)). 
Even greater diversity can be generated by the random 
combination of H and L chains, the result of co-expres- 
sion in host cells (see below). It is, thus, theoreti- 
cally possible to generate a more diverse antibody 
library in_vitro than can be generated in_v i vo . This 
library of genes is called the "high diversity" heavy 
chain library. It may be propagated indefinitely in E_;_ 
coli. A high diversity light chain library can be 
prepared similarly. 

The framework vector for the light chain library, 
designated pFLC, includes components similar to those in 
the vector for the heavy chain library: the enhancer, 
promoter, a bacterial selectable marker, an animal 
selectable marker, bacterial origin of replication and 
light chain exons encoding the c nstant regions. For 
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pFLC, the animal selectable marker should differ from the 
animal selectable in pFHC. For example, if pFHC contains 
neo , pFLC can contain Eco gpt. 

A light chain library, which contains diverse light 
05 chain fragments, is prepared as described above for 

construction of the heavy chain library. In constructing 
the light chain library, the primers used are different 
from those described above for heavy chain library 
construction. In this instance, the primers are 
XO complementary to light chain mRNA encoding constant 

regions. The framework vector contains the light chain 
constant region exons . 



Introduc tion_of^he_Li^ 

15 The library of immunoglobulin chain genes produced 

as described is subsequently introduced into a line of 
immortalized cultured animal cells, referred to as the 
"host" cells, in which the genes in the library are 
expressed. Particularly useful for this purpose are 

20 plasmacytoma cell lines or myeloma cell lines which have 
been shown to efficiently express and secrete introduced 
immunoglobulin genes (Morrison, S.L., e t_a 1 . , Ann^_N/Jf^ 
Ac a d^_S c i^ , 507:187 (1987); Kohler, G. and C. Milstein, 
Eu r^_ J ^Immunol ^ f 6:511 (1976); Galfre and C. Milstein, 

25 M e th o d s_Enz ymo L , 7 3:3 (1981); Davis, A.C. and M.J. 

Shulman, I mmuno 1^_T o da v , 10:119 (1989)). For example, 
the J558L cell line can be co transf ec ted using electro- 
poration or protoplast fusion (Morrison, S.L., e t_a 1 . , 
Ann. N.Y. Acad. Sci., 507:187 ( 1987)) and trans-fected 
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cells selected on the basis of auxotrophic markers 
present on light and heavy chain libraries. 

As a result of cotransf ormation and selection for 
markers on both light chain and heavy chain vectors, most 
transformed host cells will express several copies of 
immunoglobulin heavy and light chains from the diverse 
library, and will express chimeric antibodies (antibodies 
encoded by all or part of two or more genes) (Nisonoff, 
A., et_al. , In_Th e_An t i b o d^_M £ 1 e c ul e , Academic Press, NY 
p. 238 (1975)). These chimeric antibodies are of two 
types: those in which one chain is encoded by a host 
cell gene and the other chain is encoded by an exogen- 
ously introduced antibody gene and those in which both 
the light and the heavy chain are encoded by an exogenous 
antibody gene. Both types of antibodies will be 
secreted. A library of cells producing antibodies of 
diverse specificities is produced as a result. The 
library of cells can be stored and maintained in- 
definitely by continuous culture and/or by freezing. A 
virtually unlimited number of cells can be obtained by 
this process. 

Isolation of Cells P r o due i ng_ An t i &e n^B i ^ 
Selec ted_S£ecif ic ity^ 

Cells producing antigen-binding molecules of 
selected specificity (i.e., which bind to a selected 
antigen) can be identified and isolated using 
nitrocellulose filter layering or known techniques. The 
same methods employed to identify and isolate hybridoma 
cells producing a desired antibody can be used: - cells 
are pooled and the supernatants tested for reactivity 
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with antigen (Harlow, E. and D. Lane, Antibodies^. — A 
Laborator2_Manual , Cold Spring Harbor Laboratory, N.Y., 
p. 283 (1988). Subsequently, individual clones of cells 
are identified, using known techniques. A preferred 
05 method for identification and isolation of cells makes 
use of nitrocellulose filter overlays, which allow the 
screening of a large number of cells. Cells from the 
library of transfected myeloma cells are seeded in 10 cm 
petri dishes in soft agar (Cook, W.D. and M.D. Scharff, 
10 PNAi . 74:5687 (1977); Paige, C.J., e t _a 1 . , Methods_in 
Enzymol^, 150:257 (1987)) at a density of 10 colony 
forming units, and allowed to form small colonies 
(approximately 300 cells). A large number of dishes 
(>100) may be so seeded. Cells are then overlayed with a 
15 thin film of agarose (<lmm) and the agarose is allowed to 
harden. The agarose contains culture medium without 
serum. Nitrocellulose filters (or other protein-binding 
filters) are layered on top of the agarose, and the 
dishes are incubated overnight. During this time, 
20 antibodies secreted by the cells will diffuse through the 
agarose and adhere to the nitrocellulose filters. The 
nitrocellulose filters are keyed to the underlying plate 
and removed for processing. 

The method for processing nitrocellulose filters is 
25 identical to the methods used for Western blotting 

(Harlow. E. and D. Lane, Antibodies^ Lab o r a to r v_Manua 1 , 

Cold Spring Harbor. N.Y., p. 283 (1988)). The antibody 
molecules are adsorbed to the nitrocellulose filter. The 
filters, as prepared above, are. then blocked. The 
30 desired antigen, for example, keyhole lymphet hemocyanin 

12 5- 

(KLH) , which has been iodinated with radioactive I, is 
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then applied in Western blotting buffers to the filters. 
(Other, non radiographic methods can be used for 
detection). After incubation, the filters are washed and 
dried and used to expose autoradiography film according 
to standard procedures. Where the filters have adsorbed 
antibody molecules which are capable of binding KLH , the 
autoradiography film will be exposed. Cells expressing 
the KLH reactive antibody can be identified by 
determining the location on the dish corresponding to an 
exposed filter; cells identified in this manner can be 
isolated using known techniques. Cells which are 
isolated from a region of the dish can then be 
rescreened, to insure the isolation of the clone of 
antigen-binding molecule -producing cells. 

Isolation_of_Genes_Encoding^ 

Selected_Sp.ec if i £ i ty_and_Pur i f ic a t ion_o f _Encode d 
^Ili igen^Binding_Molecules 

The gene(s) encoding an antigen-binding molecule of 
selected specificity can be isolated. This can be 
carried out, for example, as follows: primers D and C 
(see Figures 2 and 3) are used in a polymerase chain 
reaction, to produce all the heavy chain variable region 
genes introduced into the candidate host cell from the 
library. These genes are cloned again in the framework 
vector pFHC at the RE1 and RE2 sites. Similarly, all the 
light chain regions introduced into the host cell from 
the library are cloned into the light chain vector, pFLC. 
Members of the family of vectors so obtained are then 
transformed pairwise into myeloma cells, which-are tested 
for the ability to produce and secrete the antibody with 
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the desired selectivity. Purification of the antibody 
from these cells can then be accomplished using standard 
procedures (Johnstone, A. and R. Thorpe, I mniunoch em^_.i n 
Practice, Blackwell Scientific, Oxford, p. 27 (1982); 

05 Harlow, E. and D. Lane, Antibodies^ A_Lab o r a t o ry_Manua 1 , 

Cold Spring Harbor Laboratory, N.Y., p. 283 (1988)). 

Alterat ion_of_Af f ini^ 

It is also possible to produce an t i gen - b inding 
molecules whose affinity for a selected antigen is 
10 altered (e.g., different from the affinity of a 

corresponding antigen-binding molecule produced by the 
.present .method.).... This can be carried out, for example, 
to increase the affinity of an ant i gen - b ind ing molecule 
by randomly mutagenizing the genes isolated as described 
15 above using previously- described mutagenesis methods. 
Alternatively, the variable region of antigen-binding 
molecule - encoding genes can be sequenced and site 
directed mutagenesis performed to mutate the comple- 
mentarity determining regions (CDR) (Kabat, E.A., 
20 l22HS£ii." i£I*- s 25- 36 (1988)). Both processes result in 
production of a sublibrary of genes which can be screened 
for antigen-binding molecules of higher affinity or of 
altered affinity after the genes are expressed in myeloma 
cells. 

25 Al ternat ive_Mater ial s_a 
Zl£l£!2i_Me thod 

In addition to those described above for use in the 
method of the present invention, other materials (e.g., 
starting materials, primers) and procedures can be used 
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in carrying out the method. For example, use of PCR 
technology to clone a large collection of cDNA genes 
encoding variable regions of heavy chains has been 
described above. Although primers from the class were 

05 described as being used in unidirectional nested PCR, the 
present invention is not limited to these conditions. 
For example, primers from any of the other heavy chain 

classes (C73, C 7i , ^ 2 h' ° Q f ° T exam ? le) or from li S ht 
chains can be used. Cp was described as of particular 

10 use because of the fact that the entire repertoire of 

heavy chain variable regions are initially expressed as 
IgM. Only following heavy-chain class switching are 
these variable regions expressed with a heavy chain of a 
different class (Shimizu, A. and T. Honjo, Cell, 

15 36:801-803 (1984)). In addition, the predominant 
population of B cells in nonimmune spleen cells is 
IgM + -cells (Cooper, M.D. and P. Burrows, In 
Immunoglobulin^Genes , Academic Press, N.Y. p. 1 (1989)). 
Although unidirectional nested PCR amplification is 

20 described above, other PCR procedures, as well as other 
DNA amplification techniques can be used to amplify DNA 
as needed in the present invention. For example, 
bidirectional PCR amplification of antibody variable 
regions can be carried out. This approach requires use 

25 of multiple degenerate 5' primers (Orlandi, R., £t_al., 
Pr o c^_Na t l^^Ac ad^^S c i^^U S A , 86:3833 (1989); Sastry, L. , 
e t_a 1 . , P r o c^N a t l^^Ac a d^S c i ^^U S A , 86 : 5728 (1989)). 
Bidirectional amplification may not pick up the same full 
diversity of genes as can be expected from unidirectional 

30 PCR. 
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In addition, methods of introducing further 
diversity into the antibody library other than the method 
for random mutagenesis utilizing PCR described above can 
be used. Other methods of random mutagenesis, such as 
05 that described by Sambrook, e t _a 1 . (Sambrook, J., e t_a 1 . , 

Mol ecular_Clonin£j_ A_Lab o r a t o r y_M anua 1 , Cold Spring 

Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989)) 
can be used, as can direct mutagenesis of the comple- 
mentarity determining regions (CDRs). 
10 Framework vectors other than one using a mouse Cm 

heavy chain constant region, which contains the C^ 
enhancer and introns and a viral promoter (described 
previous ly) can be. used for inserting the products of 
PCR. The vectors described were chosen for their 
15 subsequent use in the expression of the antibody genes, 

but any eukaryotic or prokaryotic cloning vector could be 
used to create a library of diverse cDNA genes encoding 
variable regions of antibody molecules. The inserts from 
this vector could be transferred to any number of 
20 expression vectors. For example, other framework vectors 
which include intronless genes can be constructed, as can 
other heavy chain constant regions. In addition to 
plasmid vectors, viral vectors or retroviral vectors can 
be used to introduce genes into myeloma cells. 
25 The source for antibody molecule mRNAs can also be 

varied. Purified resting B lymphocytes from mouse 
noniramunized spleen are described above as such a source. 
However, total spleens (immunized or not) from other 
animals, including humans, can be used, as can any source 
30 of antibody-producing cells (e.g., p ripheral b-lood, 
lymph nodes, inflammatory tissue, bon marrow). 

/ 
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Introduction of H and L chain gene DNA into myeloma 
cells using co trans forma t ion by elec troporation or 
protoplast fusion methods is described above (Morrison, 
S.L. and V.T. Oi . A dv^I nmuno L f ft!t :(>5 (1989)). However, 
05 any means by which DNA can be introduced into living 
cells in_vivo can be used, provided that it does not 
significantly interfere with the ability of the 
transformed cells to express the introduced DNA. In 
fact, a method other than co transformation , can be used. 
10 Cotransf ection was chosen for its simplicity, and because 
both the H and L chains can be introduced into myeloma 
cells. It may be possible to introduce only the H chain 
into myeloma cells. Moreover , the H chain itself in many 
cases carries sufficient binding affinity for antigen. 
15 However, other methods can also be used. For example, 
retroviral infection may be used. Replication-incompe- 
tent retroviral vectors can be readily constructed which 
can be packaged into infective particles by helper cells 
(Mann, R., e t_a 1 . , Cell, 33:153-159 (1903)). Viral 
20 titers of 10 5 infectious units per ml. can be achieved, 
making possible the transfer of very large numbers of 
genes, into myeloma cells. 

Further increases in the diversity of antibody- 
producing cells than results from the method described 
25 above can be generated if light and heavy chain genes are 
introduced separately into myeloma cells. Light chain 
genes can be introduced into one set of myeloma cells 
with one selectable marker, and heavy chains into another 
set of cells with a different selectable marker. Myeloma 
30 cells containing and expressing both H and L chains could 
then be generated by the highly efficient process of 
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polyethylene glycol mediated cell fusion (Pontecorvo, G., 
S on a t i c_C e 1 l_Ge ne t i c s , 1:397 (1975)). Thus, a method of 
screening diverse libraries of antibody genes using 
animal cells is not United by the number of cells which 
05 can be generated, but by the number of cells which can be 
screened . 

Methods of identifying antigen-binding molecule- 
expressing cells expressing an ant i gen - b inding molecule 
of selected specificity other than the nitrocellulose 

10 filter overlay technique described above can be used. An 
important characteristic of any method is that it be 
useful to screen large numbers of different antibodies. 
-With the nitrocellulose, filter overlay technique, for 
example, if 300 dishes are prepared and 10 independent 

15 transformed host cells per dish are screened, and if. on 
average, each cell produces ten different antibody 
molecules, then 300 x 10* x 3, or about 10 7 different 
antibodies can be screened at once. However, if the 
antibody molecules can be displayed on the cell surface, 

20 still larger numbers of cells can be screened using 
affinity matrices to pre-enrich for antigen-binding 
cells. There are immortal B cell lines, such as BCL^, 
which will express IgM both on the cell surface and as a 
secreted form (Granowicz, E.S., £t_al. , J^_I mmun o 1^ , 

25 125:976 (1980)). If such celis are infected by 

retroviral vectors containing the terminal Cm exons , the 
infected cells will likely produce both secreted and 
membrane bond forms of IgM (Webb, C.F., e t_a 1 . , L 
Immunol^. 14 3:3934-3939 (1989)). Still other methods can 

30 be used t detect antibody production. If the-host cell 
is E. coli, a nitrocellulose overlay is possible, and 
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such methods have been frequently used to detect E^coli 
producing particular proteins (Sambrook, J., et^al. , 

Mol ecular_Cl oningj^ *b£I al£IX_!l*BH • Cold Spring 

Harbor Laboratory Press, Cold Spring Harbor, N.Y. 
05 (1989)). Other methods of detection are possible and one 
in particular, which involves the concept of "viral 
coating" , is discussed below. 

Viral coating can be used as a means of identifying 
viruses encoding antigen- combining molecules. In this 
10 method, a viral vector is used to direct the synthesis of 
diverse antibody molecules. Upon lytic infection of host 
cells, and subsequent cell lysis, the virus becomes 
"coated" with _the antibody product it directs. That is, 
the antibody molecule becomes physically linked to the 
15 outside of a mature virus particle, which can direct its 
synthesis. Methods for viral coating are described 
below. Viruses coated by antibody can be physically 
selected on the basis of their affinity to antigen which 
is attached to a solid support. The number of particles 
20 which can be screened using this approach is well in 
excess of 10 9 and it is possible that 10 11 different 
antibody genes could be screened in this manner. In one 
embodiment, an affinity matrix containing antigen used to 
purify those viruses encoding antibody molecules with 
25 .affinity to antigen and which* coat the surface of the 
virus which encodes those antibodies is used. 

One method of viral coating is as follows-: A 
diverse library of bacteriophage X encoding parts of 
antibody molecules that are expressed in infected E^_coli 
30 and which retain the ability to bind antigens is created, 
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using known techniques (Orlandi, R.. e t_a 1 . , Proc._Natl. 
A£ ad^S c i^_US A , 8 6:3833 (1989); Huse, W.D., et_al., 
Science. 246:1275 (1989); Better. M.. e t_a 1 . . Science. 
240:1041 (1988); Skerra, A. and A. Pluckthon. Science. 
05 240:1038 (1988)). Bacteria infected with phage are 

Imbedded in a thin film of semisolid agar. Greater than 
infected bacteria may be plated in the presence of an 



10 



15 



20 



25 



10 



30 



excess of uninfected bacteria in a volume of 1 ml of agar 
and spread over a 10 cm 2 surface. The agar contains 
monovalent antibody "A" (Parham. P.. In_Handbook_o f 

Ex£ejrimenta^_Im^u^ologyi Immunochem^ , Blackwell 

Scientific Publishers, Cambridge, MA, pp. 14.1-14.23 
(19-8-6 ) ) , which can. bind, the A coat proteins and which has 
been chemically coupled to monovalent antibody "B". which 
can bind an epitope on all viral directed antibody 
molecules. Monovalent antibodies are used to prevent the 
crosslinking of viral particles. Upon lytic burst, 
progeny phage particles become effectively cross linked 
to the antibody molecule they encode. Because lysis 
occurs in semisolid medium, in which diffusion is slow, 
cross linking between a given phage and the antibody 
encoded by another phage is minimized. A nitrocellulose 
filter (or other protein binding filter) is prepared as 
an affinity matrix by adsorbing the desired antigen. The 
filter is then blocked so that no other proteins bind 
nonspecifically. The filter is overlayed upon the agar, 
and coated phage are allowed to bind to the antigen by 
way of their adherent antibody molecules. Filters are 
washed to remove nonspecifically bound phage. 
Specifically bound phage th refore represent phage 
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encoding antibodies with the desired specificity. These 
can now be propagated by reinfection of bacteria. 

Thus the present invention makes it possible to 
produce antigen-binding molecules which, like antibodies 
05 produced by presently- available techniques, bind to a 
selected antigen (i.e., having binding specifity). 
Antibodies produced as described can be used, for 
example, to detect and neutralize antigens and deliver 
molecules to antigenic sites. 

10 EXAMPLE_I Am£lif ication_of_IgM He avv_Cha invariable 

I gM "heavy chain variable DNA is amp-lified f-rom mRNA 
by the procedure represented schematically in Figure 2. 
In Figure 2, Panel A depicts the relevant regions of the 

15 poly adenylated mRNA encoding the secreted form of the 
IgM heavy chain. In Panel A, S denotes the sequences 
encoding the signal peptide which causes the nascent 
peptide to cross the plasma membrane, a necessary step in 
the processing and secretion of the antibody. V, D and J 

20 derive from separate exons and together comprise the 

variable region. C R 1 , C H 2, and C R 3 are the three constant 
domains of Cm- "Hinge" encodes the hinge region. C, B 
and Z are oligonucleotide PCR primers used in the 
amplification process. The only constraints on Primers B 

25 and Z are that they are complementary to the mRNA, and 
occur in the order shown relative to C. Primer C. in 
addition to being complementary to mRNA, has an extra bit 
of sequence at its 5 ' end which allows the cloning of its 
PCR product. This is described below. Panel B'depicts 

30 the reverse transcript DNA product of the mRNA primed by 
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oligonucleotide Z, with the addition of poly-dC by 
terminal transferase at the 3' end of the product. Panel 
C depicts the annealing of primer A to the reverse 
transcript DNA represented in Panel B . Primer A contains 
05 the restriction endonuclease site RE1 , with additional 
DNA at its 5' end. The constraints on the RE1 site are 
described in Example 2. Panel D depicts the final double 
stranded DNA PGR product made utilizing primers A and B. 
Panel E depicts the PCR product shown in Panel D annealed 
10 to Primer C. Panel F is a blow up of panel E showing the 
structure of primer C. Primer C consists of two parts: 
a 3' part complementary to IgM heavy chain mRNA as shown, 
and a 5 ' part which contains re s-t-r-ict ion-site RE2 and 
spacer. Constraints on RE2 are described in Example 2. 
15 Panel G depicts the final double stranded DNA PCR product 
utilizing Primers A and C and the product of the previous 
PCR (depicted in Panel D) as template. The S, V, D , J 
regions are again depicted. 

EXAMPLE_2 Cons true t ion_o f _Heavy_Cha in_Fr^ 

20 £FHC 

A heavy chain framework vector, designated pFHC, is 
constructed, using known techniques (See Figure 3). It 
is useful for introducing ant ibody - encoding DNA into host 
cells, in which the DNA is expressed, resulting in 

25 antibody production. The circular plasmid (above) is 

depicted linearized (below) and its relevant components 
are shown. The neomycin antibiotic resistance gene 
(neo R ) is useful for selecting transformed animal cells 
(Sambrook, J., e t_a 1 . , Mol ecul ar^C Ion ingi A_La"boratorj 

30 Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold 
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Spring Harbor, NY (1989)). The bacterial replication 
origin and ampicillin antibiotic resistance genes, useful 
respectively, for replication in E^_coli and rendering E^ 
coli resistant to ampicillin, can derive from any number 
05 of bacterial plasmids, including PBR322 (Sambrook, J., et 

al . , Molecular_Cloni.n£2 £_Lab or a t o r Y_Manu a 1 , 2d Ed., 

Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY (1989)). The C^ enhancer, which derives from the 
intron between exons J and C R 1 of the Cm gene, derives 
10 from any one of the cloned C» genes (Kawakami, T., et 
al . , Nu c lei £_Acids_Re se arch , 8:3933 (1980); Honj o , T. . 
^n. Jev^Immunol., 1:499 (1983)) and increases levels of 

transcript-ion from antibody genes L.T.R. contains the 

viral promoter from the Moloney MLV retrovirus DNA 
15 (Mulligan, R.C. , Ex2illnil!lIiI-ll^!li£Hliii£B-£f-2£Il£ 

Expression, New York Academic Press, p. 155 (1983)). 
D represents the PCR primer described in the text, 
depicted in its 5' to 3' orientation. The only con- 
straints on D are its orientation, its complementarity to 
20 pFHC and its order relative to the RE1 and RE2 cloning 
sites. Preferably, D is within 100 nucleotides of RE1 . 
The cDNA cloning site contains restriction endonuclease 
sites RE1 and RE2 , separated by spacer DNA which allows > 
their efficient cleavage. The constraints on RE1 and RE2 
25 are described below. The C/i exons, as described in the 
text and literature, direct the synthesis of IgM heavy 
chain. Only part of C H 1 is present, as described below. 
C H 3 is chosen to contain the C/is region which specifies a 
secreted form of the heavy chain ((Kawakami, T., et_al., 
30 Nucleicjcidsjesearch, 8:3933 (1980); Hon jo, T. , Ann. 
£e v^_I mmuno 1 ^ , 1:499 (1983)). Finally, pFHC contains 
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poly A addition and termination sequences which can be 
derived from the Cm gene itself (Honjo, T., Ann^Rev^, 
Immunol^. 1:499 (1983); Kawakami, T . , e t_a 1 . , Nucleic 
A£ i ds_Re se ar ch , 8:3933 (1980)). One potential advantage 
05 of using the entire gene is that in some host cell 

systems, a membrane bound and secreted form of IgM may be 
expressed (Granowicz, E.S., et_al. , ^.Immunol , 125:976 
(1980)) . 

The plasmid can be produced by combining the 
10 individual components, or nucleic acid segments, depicted 
in Figure. 3, using PCR cassett assembly (See below). 
Because the entire nucleotide sequence of each component 
is defined, the entire -nueleo tide .sequence of the plasma 
is defined. 

15 The constraints on RE1 are simple. It should be the 

sole cleavage site on the plasmid for its restriction 
endonuclease. The choice of RE1 can be made by computer 
based sequence analysis ( In te 1 1 i gene t ic s Suite, Release 
5:35, Intelligenetics) . 

20 The constraints on RE2 are more complex. First, it 

must be the sole cleavage site on the plasmid for its 
restriction endonuclease, as described for RE1. 
Moreover, the RE2 site must be such that when the PCR 
product is inserted, a gene is thereby created which is 

25 capable of directing the synthesis of a complete IgM 

heavy chain. This limits the choices for RE2 , but the 
choices available can be determined by computer based 
sequence analysis. The choices can be determined as 
follows. First, a list of restriction endonuc leas e s that 

30 do not cleave pFHC is compiled (see Table 1). 
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TABLE 1 





AacII 


Ahall 


Asel 




Avrll 


Bgll 


BspHI 


05 


BssHII 


BstBI 


Clal 




Dral 


EagI 


EcoRI 




EcoRV 


Fspl 


Hgal 




Hindi 


Hpal 


Kpnl 




Mlul 


Nael 


Narl 


10 - 


Nde I 


. . Not! ... 


. Nrul 




PaeR7I 


Pvul 


RsrII 




SacII 


Sail 


Seal 




Sfll 


SnaBI 


Spel 




SphI 


Sspl 


StuI 


15 


Tchllll 


Xbal 


Xhol 




These are called the 


"rare non-cutters." 


Next , 



sequence of C H 1 is rewritten with "N" at the third 
position of each codon and entered into the computer. 
This is called the "N-doped sequence" (See Figure 4). 

20 Next, the rare non-cutters are surveyed by computer 
analysis for those which will cleave the N-doped 
sequence. The search program will show a possible 
restriction endonuclease site, assuming a match between N 
and the restriction endonuclease cutting site. For 

25 example, with 39 rare non-cutters. 22 will cleave the 
N-doped sequence of On C^l , many of them several times 
/see Table 2). In this table. "Def" means a definite cut 
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site, of which there are none, because of the Ns. "Pos M 
means a possible cleavage site at the indicated nucleo- 
tide position if N is chosen appropriately. w Y n 
indicates any pyrimidine, w R n indicates any purine and 
"N" indicates any nucleotide. The nucleotide positions 
refer to coordinates represented in Figure 4. 
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TABLE 2 







ENZYME 


RECOGNITION 


CUT 


SITE 








Aatll 


(GACGTC) 


Def : 
Pos 


none 
250 


309 


05 




Ahall 


(GRCGYC) 


Def : 
Pos : 


none 
247 


306 






Avrll 


(CCTAGG) 


Def : 
Pos : 


none 
204 




10 




BspHI 


(TCATGA) 


Def : 
Pos 


none 
138 






Bsshll 


(GCGCGC) 


De f : 
Pos : 


none 
189 








EcoRI 


(GAATTC) 


Def : 
Pos : 


none 
195 


334 


15 




EcoRV 


(GATATC) 


Def : 
Pos 


none 
214 




20 




Hgal 

Hindi 


(GACGCNNNNN) 

( NNN NNNNN N N GCGTC ) 

(GTYRAC) 


Def : 
Pos : 
Def : 
Pos 


none 
2 84 

none 
183 


220 






Hpal 


(GTTAAC) 


Def 
Pos 


none 
220 








Kpnl 


(GGTACC) 


Def 
Pos 


none 
: 408 




25 




Nrul 


(TCGCGA) 


Def 
Pos 


none 
: 174 


193 






PaeR7 


(CTCGAG) 


Def 
Pos 


: none 
: 190 


339 


30 




Pvul 


(CGATCG) 


Def 
Pos 


: none 
: 178 






Seal 


(AGTACT) 


Def 
Pos 


: none 
: 209 


266 






Spel 


(ACTAGT) 


Def 
Pos 


none 
: 131 


167 


35 




SphI 


(GCATGC) 


Def 
Pos 


; none 
: 338 








Sspl 


(AATATT) 


Def 
Pos 


: none 
: 371 




40 




StuI 
Tthllll 


(AGGCCT) 
(GACNNNGTC) 


Def 
Pos 
Def 
Pos 


none 
: 149 
: none 
: 212 








Xbal 


(TCTAGA) 


Def 
Pos 


: none 
: 338 




45 




Xhol 


(CTCGAG) 


Def 
Pos 


: none 
190 


339 



303 



284 



359 
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- 38- 

Most of these cleavage sites (about 60%) are compatible 

with the amino acids specified by C H 1 . Therefore, it is 

possible to mutate C H 1 to create a unique site for such 

an enzyme without altering the amino acid sequence 

incoded by C u l. One sequence which illustrates this is 
n 

shown below: 

1) ...ala met gly cys leu ala arg asp... 

2 ) ...GCC ATG GGC TGC CTA GCC CGG GAC... 

3) ...GCC ATG GGC TGC CTA GCG CGC GAC... 



BssHIl 



Line 1 represents part of the actual amino acid 
sequence specified by the mouse Cn C R 1 gene region, and 
line 2 is the actual nucleotide sequence . By changing 
the sequence to the indicated nucleotides underlined on 
line 3, a cleavage site for the rare non-cutter BssHIl is 
created. The new sequence (containing the BssHIl site) 
GCG CGC still encodes the identical amino acid sequence. 
Therefore, the sequence of the primer C is chosen to be 
the complement of line 3, and RE2 is the BssHIl site. 
Such a primer will function in the PCR and vector 
construction as desired. Other examples are possible, 
and the same process can be used in designing vectors and 
primers for cloning light chain variable regions. 

The choice for primer C puts a constraint on pFHC. 
In the example shown, the C H 1 region contained -on pFHC 
must begin at its 5' nd with the mutant sequence GCG 
CGC. Such mutant fragments can b readily made by the 
process of PCR cassett assembly described below. 
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The process of PCR cassette assembly is a method of 
constructing plasmid molecules (in this case the plasmid 
pF HC) from fragments of DNA of known nucleotide sequence. 
One first compiles a list of restriction endonucleases 
that do not cleave any of the fragments. Each fragment 
is then individually PCR amplified using synthesized 
oligonucleotide primers complementary to the terminal 
sequences of the fragment. These primers are synthesized 
co contain on their 5' ends restriction endonuclease 
cleavage sites from the compiled list. Thus, each PCR 
product can be so designed that each fragment can be 
assembled one by one into a larger plasmid structure by 

cleavage and ligation and transformation into E_c o 1 i 

Using this method, it is also possible to make minor 
.edifications to modify the terminal sequence of the 
fragment being amplified. This is done by altering the 
PCR primer slightly so that a mismatch occurs. In this 
way it is possible to amplify the C M gene starting 

a „ n < nf in C 1 (as determined by 
precisely from the desired point in i^i v 

pp? endonuclease cleavage 
oligo C above) and creating the REZ enoonuc 



25 



site 



EQUIVALENTS 

" Those skilled in the art will recognize, or be able 
to ascertain using no more than routine experimentation, 
na ny equivalents to the specific embodiments of the 
invention described specifically herein. Such 
equivalents are intended to be encompassed in the scope 
of the following claims. 
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